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(54) Expression vectors for the bovine trypsin and trypsinogen and host cells transformed therewith. 

(57) DNA sequences encoding bovine trypsin and bovine trypsinogen are provided as are recombinant 
DNA vectors comprising these sequences. Host cells transformed with the trypsin and trypsinogen 
expression vectors are disclosed in context of means for producing bovine trypsin and bovine 
trypsinogen. 
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i rypsin is a protease wmcn c:eaves to trie carooxyi side of lysine and argenine residues. Trycsin is pro- 
cucea ;n the form of a precursor or zymogen mo;ecuie cai.ea tripsinogen, 'rypsinogen is convertea to trypsin 
by the action of enteropepticase 

The substrate specificity of trypsin prov.aes a useful enzyme for conversion of biosyntheticaiiy produced 

5 molecules to preferred molecules. An examoie is the conversion of proinsuhn to insulin via trypsin mediated 
removal of the connecting pept de. Trypsin is commercially available and is produced primarily by isolation from 
the pituitary glands of a variety of species Bovine and porcine pancreases are particularly common sources 
of trypsin. Purification procedures utilized to purify trypsin for later use in byconversion processes aim to re- 
move undesirable copunfymg proteases from the desired trypsin product. 

'0 Notwithstanding much effort at purification, various lots of trypsin contain variable amounts of contamin- 

ating proteases. Chymotrypsm is frequently present in minimal amounts in trypsin production lots. The pres- 
ence of even a minor amount of a contaminating protease results in undesirable cleavage of various products 
when only the trypsin mediated cleavage is desired. Conversion of proinsuhn to insulin via the action of trypsin 
is thus complicated by contaminants of other proteases. The present invention solves the problem of contam- 

15 mating protease contaminations Dy providing recombinant DNA expression systems for the biosynthetic pro- 
duction of bovine trypsin and trypsinogen. Thus, the present invention represents a significant advance in the 
art of typsm and trypsinogen production tnereby greatly facilitating bio-conversion of precursor molecules. 

The present invention discloses and claims DNA sequences which encode bovine trypsin and trypsinogen. 
Expression vectors useful for producing trypsin and trypsinogen are also disclosed and claimed as are host 

20 cells transformed with these expression vectors. The expression vectors and host cells of the present invention 
provide a convenient source for trypsin and trypsinogen molecules, free of contaminating proteases which 
disrupt biosynthetic conversation processes. 

A series of figures are provided to further understanding of the invention. Figure 1 is a restriction site and 
function map of plasmid pRMG4 Figure 2 is a restriction site and function map of plasmid pRMG5. Figure 3 is 

25 a restriction site and function map of plasmid pRMG6. Figure 4 is a restriction site and function map of plasmid 
pRMG7. Figure 5 is a restriction site and function map of plsmid pHKY390. 

The ability to produce trypsin either by direct expression or by production of the zymogen, trypsinogen 
affords flexibility in the isolation, purification and folding of trypsin by allowing the initial steps of trypsin pro- 
duction to be performed on an enzymatically inactive form. 

30 The expression vectors provided by the instant invention were prepared by replacing the kanamycin phos- 

photransferase coding region of olasmid pHKY390 with chemically synthesized double-stranded DNA encod- 
ing trypsin or trypsinogen. Plasmid pHKY390 was deposited with the Northern Regional Research Laboratory 
(N.R.R.L.), Peoria, il_ USA on January 17. 1992, where it is available under the accession number NRRL B- 
18885. Ptasmid pHKY390 was deposited m the E. coii host strain RV308. 

35 The chemically synthesized genes encoding trypsin and trypsinogen were prepared on an Applied Bio- 

systems DNA synthesizer using ^-cyanoethyl phosphoramidite chemistry. A series of 20 oligonucleotides was 
synthesized as described in Example 1 The appropriate oligonucleotides were then annealed and ligated to 
generate double stranded DNA molecules encoding bovine trypsin and bovine trypsinogen. The double strand- 
ed DNA sequence which was orepared to encode bovine trypsin is provided below as Formula 1 . The amino 

•*o acid sequence encoded by the corresponding DNA is provided below the oligonucleotide sequence. Sequence 
I.D. 21 , which is provided in a later section of this disclosure, corresponds to the sense strand of the sequence 
provided in Formula I. Sequence I D. 22 corresponds to the amino acid sequence of Formula 1. The oligonu- 
cleotide sequences, which flank the coding sequence, are designated by lower case letters and the stop co- 
don, TAG is designated as END n the ammo acid sequence provided below the oligonucleotide sequence of 

45 Formula 1 . 
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Formula I 



N N 

e r 
I I 

catATGATCGTTGGCGOCTACACCTGTGGCGCCAATACCGTCCCGTACCAC^TGTCCCTG 
1 + ♦ + +■ + + 6 

& TACTAGCAACCGCCGATGTGGACACCGCGGTTATGGCAGGGCATGGTCCACAGGGAC 
MetlleValGlyGlyTyrThrCysGlyAlaAsnThrValProTyrGlnValSerLeu - 

AATTCTGGCTACCACTTCTGTXXJTGGC^^ 
61 + + + ^ + 

TT AAG AC CG ATGGTG AAG AC AC C AC CGAGGGAGTAGTTGAGGGTC AC CC AC CAT AG TCGC 
AsnSerGlyTyrHisPheCysGlyGlySerLeuIleAsnSerGlnTrpValValSerAla - 



20 

GCCCACTGCTACAAGTCCGGCATCCAGGTGCGTCTGGGCGAGGATAACATCAACGTCGTG 
121 + + + + ^ 

a CGGGTGACGATGTTCAC^CCGTAC^TCCACGCAGACCCGCTCCTATTGTAGTTGCAGCAC 
AlaHisCysTyrLysSerGlylleGlnValArgLeuGlyGluAspAsnlleAsnValVdl 

25 



A 
P 

30 a 

L 
I 

GAGGGCAATGAGC AGTTCATCTCCGCATCCAAGTCCATCGTGCACCCGTCC^ACAACTCC 



35 



40 



45 



181 

CTCCCGTTACTCGTCAAGTAGAGGCGTAGGTTCAGGTAGCACGTGGGCA^ATC^AGG 
GluGlyAsnGluGlnPhelleSerAlaSerLysSerlleValHisProSerTyrAsnSer - 

AACACTCTGAACAATGACATCATCCTGATCAAGCTCAAGTCCGCCGCATCCCTGAACTCC 
241 + + + + + 300 

TTGTGAGACTTGTTACTGTAGTACGACTAGTTCGAGTTCAGGCGGCGTAGGGACTTGAGG 
AsnThrLeuAsnAsnAspIleMetLeuIleLysLeuLysSerAlaAlaSerLeuAsnSer - 

CGCGTGGCCTCCATCTCTCTGCCGACCTCCTGTGCCTCCGCCGGCACGCAGTGCCTCATC 
301 + + + + ^ 36Q 

GCGC AC CGG AGGT AG AG AG ACGGCTGGAGG A CACGGAGGCGGCCGTGCGTC AC GGAGT AG 
ArgValAlaSerlleSerLeuProThrSerCysAlaSerAlaGlyThrGlnCysLeuIle - 

TCTGGCTGGGGCAACACTAAGAGCTCTGGCACCTCCTACCCAGACGTGCTGAAGTGCCTG 
361 + + + + + 42Q 

AGACCGACCCCGTTGTGATTCTCGAGACCGTGGAGGATGGGTCTGCACGACTTCACGGAC 
SerGlyTrpGlyAsnThrLysSerSerGlyThrSerTyrProAspValLeuLysCi'sLeu - 
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75 



20 



B 
a 

1 



421 



AAGGCTCCTATCCTGA3CGAT^CCTCC-rGTAAGTCCGCCTACCCTOGC 



:agattaccac 



: -AGGATAGGACTC3CTAAGGAGGACATTCAGGCGGATGGGACCGGTCTAATGGTCG 
LysAlaPrcIleLeuSerAspSerSerCysLysSerAlaTiTProGlyGlnrieThrSer 



4 30 



AACATG A . CTGTCCCGGCTACCTGGAGGGCGGCAAGGAT^CCT^T^ 
■* o 1 - - _ + — — _ _ + _ _ _ _ 

yTCTACAAGACACGGCCGATGCACCT^ 540 

AsnMetPheCysAlaGlyTyrLeuGluGlyGlyLysAspSerCysGlnGlyAspSerGly - 



541 



C^CCCTGTGGTCTGCTCCGGCAAGCTCCA^ 

^CGGG AC ACC AG ACGAGGCCGTTCGAGGTTCCGTAGC AG AGG AC CC 6 ° ° 

GlyProValValCysSerGlyLysLeuGlnGlylleValSerTrpGlySerGlyCysAla - 



_ C AGAAGAAC AAGCCTGGCGTCTAC ACC AAGGTCTGTAACTATGTGTCCTGGATTAAGCAG 

GTCTTCTTGTTCGGACCGCAGATGra^ ° 
GlnLysAsnLysProGlyValTyrThrLysValCysAsnTyrValSerTrpIleLysGln - 



660 



25 B 

a 
m 
H 
I 

ACCATAGCTTCC AATt agga tec 

+ * 683 

TGGTATCGAAGGTTAa t cc t agg 
Thr IleAlaSerAsnEnd 



30 6 61 



The double stranded sequence encoding bovine trypsin is provided to add detail to the single stranded 
35 format required in the Sequence Identification section of this disclosure. Restriction endonuclease recognition 
sites are provided above the sequence as appropriate; the amino acid encoded by each codon is presented 
below the DNA sequence; and the nucleotides forming the flanking regions of the coding region are provided 
to illustrate via restriction endonuclease recognition sites and linkers the manner whereby the coding sequence 
was inserted into the expression vectors. 
40 The DNA sequence synthesized to comprise a bovine trypsinogen encoding region is provided below as 

Formula II. The format is similar to that provided above for the region encoding bovine trypsin(Formula I). Se- 
quence I.D. 23, which is provided in a later section of this disclosure, corresponds to the coding sequence of 
Formula II while Sequence I.D. 24 provides the amino acid sequence encoded thereby. The oligonucleotide 
sequences, which flank the coding sequence, are designated by lower case letters and the stop codon.TAG 
-*5 is designated as END in the ammo acid sequence provided below the oligonucleotide sequence of Formula 



50 



55 



4 



EP 0 597 681 A1 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Formula I I 



N 



M 



d 

e r 
I I 
ca t ATGGT3GATGA7X;ATK^ATAAGATCGTT^ 

1 , + + + + ^ eo 

gtaTACCACCTACTACTACTATTCTAGCAACCGCCGATGTGGACACCGCGGTTATGGCAG 
MetValAspAspAspAspLysI leValGlyGlyTyrThrCysGlyAlaAsnThrVal - 



CCGTACCAGGTGTCCCTGAATTCT^CTACCACTTCTGTGGTGGCTCCCTCATCAACTCC 
61 + + + + 

GGCATGGTCCACAt^GGACTTAAGACCGATGGTGAAGACACCACCGAGGGAGTAGTTGAGG 
ProTyrGlnValSerLeuAsnSerGlyTyrHisPheCysGlyGlySerLeuIleAsnSer - 

cagtg<x;tggtatcagcggcccactgctacaagtccggcatccaggtgcgtctgggcgag 

121 + + . + 18Q 

gtcacccaccatagtcgccgggtgacgatgttcaggccgtaggtccacgcagacccgctc 

GlnTrpValVa lS^rAlaAlaHi sCysTyrLysSerGly IleGlnValArgLeuGlyGlu - 
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10 



15 



A 

P 
a 
L 

GATAAC AT3AACGTCGTGGAGGGCAAT^A3C AGTTCATCTCCGCATCCAAGTCCATCGTG 
181 «. + ^ ^ 24Q 

CTATTGTAGTTGC A3C ACCTCCC3TTACTCGTCAAGTAGAGGCGTAGGTTCAGGTAGC AC 
AspAsnlleAsnValValGluGiyAsnGiuGlnPhelieSerAlaSerLysSerlieVal - 

CACCCGTCCTAC AACTCC AACACTCTGAAC AATGACATCATGCTGATCAAGCTCAAGTCC 
241 + + + + + ^ 30Q 

GTGGGCAGGATGTTGAGGTTGTGAGACTTGTTACTGTAGT^^ 

HisProSerTyrAsnSerAsnThrLeuAsnAsnAspIleMetLeuIleLysLeuLysSer - 

GCCGCATCCCTGAACTCCCGCGTGGCCTCCATCTCTCTGCCGACCTC^TGTGC^T^^GCC 

301 + «. + ^ ^ 360 

CGGCGTAGGGACTTGAGGGCGCACCGGAGGTAGAGAGACGGCTGGAGGACACGGAGGCGG 
AlaAlaSerLeuAsnSerArgVaiAUSerl leSerLeuProThrSerC/sAlaSerAU - 



GGCACGCAGTGCCTC ATCTCTGGCTGGGGC AAC ACTAAGAGCTCTGGCACCTCCTACCCA 
361 + + + + ^ ^ 420 

CCGTGCGTCACGGAGTAGAGACCGACCCCGTTGTGATTCTCGAGACCGTGGAGGATGGGT 
GlyThrGlnCysLeuIleSerGlyTrpGlyAsnThrLysSerSerGlyThrSerTyrPro - 

GACGTGCTGAAGTGCCTGAAGGCTCCTATCCTGAGCGATTCCTCCTGTAAGTCCGCCTAC 
421 * . + + ^ ^ 480 

CTGCACGACTTCAC-GGACTTCCGAGGATAGGACTCGCTAAGGAGGACATTCAGGCGGATG 
AspValLeuLysC\'sLeuLysAlaProUeLeuSerAspSerSerCysLysSerAlaTyr - 

B 
a 
1 
I 

GCTGGCC AGATTACCAGCAACATGTTCTGTGCCGGCTACCTGGAGGGCGGCAAGGA'TTCC 
481 . . + , ^ 54Q 

CX3ACCGGTCTAATG<3TrGTTGTACAAGACACGGCCGATGGACCTCCCGCCGTTCCTAAGG 
ProGlyGlnlleThrSerAsnMetPheCysAlaGlyTyrLeuGluGlyGlyLysAspSer - 

TGTCAGGGTGATTCTOGTGGCCCTX^TGGTCTGCTCCGGCAAGCTCCAAGGCATCGTCTCC 
541 . + + ^ + 6Q0 

ACAGTCCCACTAAGACCACCGGGACACCAGACGAGGCCGTTCGAGGTTCCGTAGCAGAGG 
CysGlnGlyAspSerGlyGlyProValValCysSerGlyLysLeuGlnGlylleValSer - 

TGGGGTTCCGGCTGTGCCCAGAAGAACAAGCCTGGCGTCTACACCAAGGTCTGTAACTAT 
45 601 + , + + + ^ ^ 6Q 

ACCCCAAGGCCGACACGGGTCTTCTTGTTCGGACCGCAGATGTGGTTCCAGACATTGATA 
TrpGlySerGlyCysAlaGlnLysAsnLysProGlyValTyrThrLysValCysAsnTyr 



25 



30 



35 



40 



50 



B 
a 
m 
H 



GTGTCCTGGATTAAGC AGACCATAGCTTCGAATt aggat cc 

55 66 1 * + * + - *701 

CACAGGACCTAATTCGTCTOGTATCGAAGGTTAatcctagg 
ValSerTrpI leLysGlnThr I leAlaSerAsnEnd 
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The gene for bovine trypsin was prepared by assembling subsets of the oligonucleotides described in Ex- 
ample 1 into three separate cassettes prior to combining the three cassettes to form the full length bovine tryp- 
sin encoding gene. Oligonucleotides BT1-6 were annealed and inserted into the commercially available vector, 
pBluescnpt SK+ (Stratagene). Oligonucleotide sequences BT7-12 were likewise annealed and inserted into a 

5 pBluescnpt SK+ cloning vector. The third cassette was generated upon ligation of oligonucleotides BT13-18 
and insertion into a third pBluescnpt SK+ cloning vector. The three cassettes encoding portions of the bovine 
trypsin gene each have a Hind III termini and an Xba l termini. The bovine trypsin encoding sequence was syn- 
thesized as three separate components to minimize the chance for spontaneous mutations occurring within 
the sequence. The cloning vector comprising oligonucleotides BT1-6 is designated pRMG1 . The cloning vector 

10 comprising oligonucleotides BT7-12 is designated plasmid pRMG2. The cloning vector comprising oligonucleo- 
tides BT13-18 is designated pRMG3. The three portions of the bovine trypsin encoding sequence were pre- 
pared by digesting plasmids pRMG1 , pRMG2, and pRMG3 with appropriate endonucleases followed by ligation 
of the fragments and insertion into an expression vector. The expression vector utilized in the construction of 
trypsin and trypsinogen expression vectors is designated plasmid pHKY390. Plasmid pHKY390 has been de- 

15 posited in the Northern Regional Research Laboratory, Peoria, IL where it is publicly available under the ac- 
cession number B-1885. 

A restriction site and function map of plasmid pHKY390 is provided in Figure 5. Plasmid pHKY390 was orig- 
inally used as a promoter probe wherein promoters were evaluated for their ability to cause transcription of 
the kanamycin phosphotransferase gene of plasmid pHKY390. Reference to Figure 5 reveals that an IMde l and 

20 Bam HI site are conveniently located in plasmid pHKY390 for insertion of a sequence encoding a polypeptide 
product of interest . Plasmid pRMG4 was constructed by insertion of the trypsin encoding gene into the 
Nde l/ Bam HI digested plasmid pHKY390. The three fragments which upon ligation generate the trypsin encod- 
ing gene were prepared as described in Example 4. A restriction site and function map of plasmid pRMG4 is 
provided in Figure 1. Plasmid pRMG4 utilizes a modified lambda pL promoter, p97, to drive transcription of a 

25 two cistron message wherein the second cistron encodes bovine trypsin. Plasmid pRMG4 uses a tetracyline 
resistance gene as a selectable marker. The temperature sensitive lambda pL repressor, c1857, is utilized to 
provide regulatable transcription from the modified lambda promoter. The origin of replication utilized in plasmid 
pRMG4 was prepared originally from plasmid pBR322. Plasmid pRMG4 also utilizes a rop gene. The rop gene 
provides a vector copy number of approximately fifteen to twenty when utilized, as in the vectors of the present 

30 invention, with a pBR322-derived origin of replication. 

Plasmid pRMG7 is the preferred expression vector for bovine trypsinogen. Reference to Figures 1 and 4 
and the examples indicates the high level of similarity between the preferred expression vectors for bovine 
trypsin and bovine trypsinogen. Accordingly the description of the elements in plasmid pRMG4 is likewise ap- 
plicable to plasmid pRMG7. 

35 A variety of E. coli host cells were utilized in the construction of the vectors and expression systems of 

the present invention. E. coli RV308 is available from the Northern Regional Research Laboratory, Peoria. IL 
(NRRL) under the accession number NRRL B-15624E. coli MM294 is available from the American Tissue Cul- 
ture Collection, Parklawn , MD (ATCC) under the accession number ATCC 31446. The inability of either of 
these strains to support expression of bovine trypsin or bovine trypsinogen from plasmids pRMG4 and pRMG7 

40 respectively underscores the unpredictability, which remains in the art of molecular biology. The reason or rea- 
sons why such well recognized E. coli host strains were incapable of achieving expression of trypsin and tryp- 
sinogen remains unelucidated. Digestion of either the messenger RNA or the desired protein product could 
account for the failure to affect expression in these strains. E. coli L687, a /on- host cell, was eventually tried 
and this host cell strain proved to be competent for expression of bovine trypsin and bovine trypsinogen from 

45 pRMG4 and pRMG7 respectively. E. coli L687 was deposited in the NRRL where it is available under the ac- 
cession number B-1 8884. Accordingly, E. coli L687 transformed with plasmids pRMG4 and pRMG7 comprise 
the respective best modes for producing bovine trypsinogen and bovine trypsin in prokaryottc cells. The media 
utilized in the fermentative production of the enzyme and zymogen of the present invention affect the overall 
production levels of the desired products. L-broth is the preferred media for such fermentation processes. The 

50 components of L-broth are 1 % (w/v) Bacto tryptone; 0.5% (w/v) Yeast extract; 0.5% (w/v) NaCI; and 0.1 % (w/v) 
dextrose at pH 7.0. L-agar is L-broth solidified with 1.5% (w/v) Bacto agar. 

The expression products of plasmids pRMG4 and pRMG7 have been established by conventional biochem- 
ical methodologies to be bovine trypsin and trypsinogen respectively. The availability of trypsin, whether ex- 
pressed directly or converted from its zymogen precursor, provides a signif igant advantage in biochemical con- 

55 version processes such as the removal of the connecting peptide of insulin. The source of enzyme devoid of 
contaminating proteases allows substantially greater flexibility in the production of important medicinal poly- 
peptides such as insulin. The biosynthetic source of the enzyme also eliminates any concerns related to the 
use of enzymes prepared from animal sources in the production of molecules which will be administered to 
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humans or animals. 

The examples which follow are intended to further illustrate the present invention anc are not to be inter- 
preted as limiting on the scope thereof. While the examples and detailed description sections of the present 
invention are sufficient to guide anyone of ordinary skill in the art in the practice of the present invention, skilled 

5 artisans are also directed to Molecular Cloning A Laboratory Manual Second Edition. Sambrook.J.. Fritsch. E. 
R. and Mamatis, T., Cold Spring Harbor Press 1989 and Current Protocols in Molecular Biology, Ausubel, F.M.. 
Brent. R.. Kingston, R.E., Moore. D. D.. Seidman. J.G., Smith. J. A. .and Struhl, K..Ed. Greene Publishing Asso- 
ciates and Wiley-lnterscience 1989. The aforementioned resources provide an excellent technical supplement 
to any discourse in genetic engineering. 

■ c The examples provide sources for reagents, however it will be understood that numerous vendors market 

reagents of high quality for use in the protocols and procedures described below and the substitution of re- 
agents or protocols is contemplated by the present invention and embraced in the scoDe thereof. All temper- 
atures unless otherwise noted are expressed in degrees Centigrade. All percentages are on a weight per weight 
basis unless otherwise noted. 

15 

Example 1 

Oligonucleotide synthesis and purification 

The following oligonucleotides were synthesized on an Applied Biosystems (Foster City, CA) model 380B 
20 DNA synthesizer using beta-cyanoethyi phosphoramidite chemistry according to the manufacturer's instruc- 
tions. The single stranded DNA segments were conventionally purified on 12% polyacrylamide-7M urea gels 
and resuspenaed in water. 

25 

fill , (Sequence I.D. 1} (Sequence Length: 77) 

5 ' AGCTTCATATGATCGTTGGCGGCTACACCTGTGGCGCCAATACCGTCCCGTACCAGGTG 
30 TCCCTGAATTCTGGCTAC-3 ' 



&T2 (Sequence I.D 2) (Sequence Length: 77) 

5 ' AGTGGT AGC C AG AATTC AGG G AC ACCTGGT AC GGG AC GGTATTGGCGCC AC AGGTGT AG 
CCGCC AACG ATCATATGA - 2 ' 

(Sequence I.D. 3) (Sequence Length: 81} 

5 ' CACTTCTGTGGTGGCTCCCTCATCAACTCCCAGTGGGTGGTATCAGCG3CCCACTGCTA 
CAAGTCCGGCATCCAGGTGCGT - 3 ' 

ST4A (Sequence I.D. 4) (Sequence Length: 81) 

5 ' CCAGACGCACCTGGATGCCGGACTTGTAGCAGTGGGCCGCTGATACCACCCACTGGGAG 
TTGATGAGGGAGCCACCACAGA-3 ' 

8 
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£Xi 'Sequence I.D. 5) (Sequence Length: 73) 

5 ' CTGGGCGAGGATAACATCAACGTCGTGGAGGGCAATGAGCAGTTCATCTCCGCATCC AA 
GTCCATGGTGCACT - 3 ' 

&TS Sequence I.D. 6) (Sequence Length: 73) 

5 ' CTAGAGTGCACGAT GGACTTGGATGC GG AG ATGAACTGCTCATTGCCCTCCACGACGTT 
GATGTTATCCTCGC- 3 ' 

ET2 (Sequence I.D. 7) (Sequence Length: 84) 

5 ' AGCTTCATCGTGC ACCCGTCCTACAACTCCAACACTCTGAACAATGACATC ATGCTGAT 
C AAGCTCAAGTCCGCCGCATCCCTG - 3 ' 

BX2. (Sequence I.D. 3) (Sequence Length: 84) 

5 ' AGTTCAGGGATGCGGCGGACTTGAGCTTGATCAGCATGATGTCATTGTTCAGAGTGTTG 
GAGTTGTAGGACGGGTGCACGATGA-3 ' 

BT? (Sequence I.D. 9) (Sequence Length: 93) 

5 ' AACTCCCGCGTGGGCTGCATCTCTCTGCCGACCTCCTGTGCCTCCGCCGGCACGCAGTG 
CCTCATCTCTGGCTGGGGCAAG ACTAAGAGCTCT- 3 ' 

&T1Q (Sequence I.D. 10) (Sequence Length: 93) 

5 ' TGCCAGAGCTCTTAGTGTTGCCCCAGCCAGAGATGAGGCACTGCGTGCCGGCGGAGGGA 
C AGGAGGTCGGCAG AGAG ATGGAGGCC ACGCGGG - 3 ' 
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BX11 'Sequence 3 



£ e qijen :e Length: 3 8 



20 



25 



5 ' GGCACCTC ~ AC C _ A3 AC 3T ~3T 3AAGT3 3 3TGAAGG3T3 3TATCCTG AGCG ATTCC7C 
C T GTAAGTC 333 CT A 3 3 C T 3 3 33 AG AT TT - 3 ' 

BUZ (Sequence 1.3. 12; ( Sequence Length: 83) 

5 ' CTAGAAATCTGG3G AGGGT AG 3C GGACTTAC AGGAGGAATCGCTC AGG ATAGGAGCCTT 
C A 3 G C AC T T C AG 3 A 3 3 T C TG 3 3 T A G G AG G - 3 ' 



BT13 (Sequence 
5 ' AGCTTCCTGGC 



V_ ."A J rt * 



. 13 (Sequence Length: 77) 

~ACGAGCAACATGTTCTGTGCCGGCTACCTGGAGGGCGGCAAG 



GATTCCTGCT AGGGT 3 AT - ! 



JO 



35 



BT14 (Sequence I.e. 14) ''Sequence Length: 77 1 

5 ' CAGAATC ACCCTGAC AGG AATCCTTGCCGCCCTCCAGGT AGCCGGC ACAGAACATGTTG 
CTGGTAATCTGGCCAGGA- i ' 



40 



45 



BT15 (Sequence :.C. 15 ? (Sequence Length: ^6) 

5 ' TCTGGTGGCCCTGTGGTCTGCTCCGGCAAGCTCCAAGGCATCGTCTCCTGGGGTTCCGG 
CTGTGCCCAGAAGAAC A - 3 ' 



BT16 (Sequence I. 3. 16) (Sequence Length: ~6) 

5 ' GGCTTGTTCTTCTGGGCACAGCCGGAACCCCAGGAGACGATGCCTTGGAGCTTGCCGGA 
GCAGACCAC AGGGCC AC - 3 ' 
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SX12 (Sequence i.d. 17} (Sequence Length: 74) 

5 ' AGCCTGGCGTCTAC ACCAAGGTCTGT AACTATGTGTCCTGGATTAAGCAGACCATAGCT 
TCC AATTAGGATCCT - 3 ' 



fiTlS (Sequence I.D. 18) (Sequence Length: 74) 

5 ' CTAGAGGATCCTAATTGGAAGCTATGGTCTGCTTAATCCAGGACACATAGTTACAGACC 
TTGGTGTAGACGCCA-3 ' 



20 

BX13_ (Sequence i.d. 19) (Sequence Length: 45) 

5 ' -TATGGTGGATGATGATGATAAGATCGTTGGCGGGTACACCTGTGG - 3 ' 

25 



(Sequence I.D. 20) (Sequence length: 45) 
30 5 ' -CGCCACAGGTGTAGCCGCCAACGATCTTATCATCATCATCCACCA-3 ' 



Example 2 

35 Construction of pRMG1 

A. Preparation of 231 base pair Hindlll-Xbal gene segment 

Six fag of oligonucleotides BT2, BT3A, BT4A. and BT5 were individually phosphorylated in 20 uJ re- 
actions containing 50 mM Tris-HCI {pH 7.8), 10 mM MgCI 2 , 5 mM dithiothreitol, 5% glycerol. 100 uM ade- 
nosine triphosphate, and 20 units T4 polynucleotide kinase (Boehringer Mannheim, Indianapolis, IN) at 

40 3 7°C for 30 min. The kinase was thermally inactivated by heating at 70°C for 5 min. 

Six ng of each of the above phosphorylated oligonucleotides was mixed with 6 ng (6pJ) each of oli- 
gonucleotides BT1 and BT6, heated at 70° C for 5 min. and cooled to room temperature to allow the oli- 
gonucleotides to anneal. The annealed oligonucleotides were then treated with 30 units T4 DNA ligase 
(Boehringer Mannheim, Indianapolis, IN) in a 200 f.il reaction containing 50 mM Tris-HCI (pH 7.8), 10 mM 

45 MgCI 2 . 5 mM dithiothreitol. 5% glycerol, and 100 uM adenosine triphosphate for 1 hour at 20°C then 18 

hours at 15°C. 

The desired 231 base pair DNA fragment was conventionally purified on an 8% polyacrylamide gel 
and resuspended in water. Two ng of the purified DNA fragment was treated with 20 units of T4 polynu- 
cleotide kinase in a 20 j.i1 reaction containing 50 mM Tris-HCI (pH 7.8), 10 mM MgCI 2 , 5 mM dithiothreitol, 
so 5% glycerol, and 100 u,M adenosine triphosphate at 37°C for 30 min. 

B. Preparation of pBluescript SK+ vector 

Twenty ng of plasmid pBluescript SK+ (Stratagene, LaJolla. CA) was digested to completion with 100 
units Hindi II (Boehringer Mannheim, Indianapolis. IN) and 100 units Xba l (Boehringer Mannheim. Indian- 
apolis. IN) in a 250 ^ reaction containing 50 mM Tris-HCI (pH 8.0), 10 mM MgCI 2 , 50 mM NaCI. and 100 
55 (.tg/ml bovine serum albumin at 37°C for one hour. The enzymes were thermally inactivated by heating at 

70°C for 10 min. 

The 5* termini were dephosphorylated by treatment of the DNA with 5 units (5 ul) calf intestinal alkaline 
phosphatase (Boehringer Mannheim, Indianapolis. IN) at 37°C for 30 min. The enzyme was thermally in- 
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activated Dy neating at 70-C f or 1 5 mm. T he solution was extractea with an ecuai volume of phenol equi- 
libratea with -00 mM Tpis-HC! (pH 3.0). The aqueous iayer was recovered and DNA was precipitated by 
the addition of 0. 1 volume 3 M sodium acetate and 2.2 volumes of absolute et hanot. The DNA was collected 
by centnfugation and resuspended in 300 ul water. 

5 C. Final construction of pRMG1 

1.3 ug of the purified 231 base pair fragment prepared in Example 2A and 0.3 ug of the pBluescnpt 
vector DNA prepared in Example 1B were Mgated with 10 units of T4 DNA ligase in a 10 ul reaction con- 
taining 50 mM Tns-HCI (pH 7.8). 10 mM MgCI 2 . 5 mM dithiothreitol . 5% glycerol, and 100 adenosine 
triphosphate at 20'C for 18 hours. 

1 0 A portion of the ligation mixture was used to transform E. coli K12 MM294 cells. Transformants were 

selected on L agar containing 50 ug/ml ampicillin. Ampicillin-resistant transformants containing the desired 
plasmid pRMG1 were identified following plasmid DNA purification by restriction enzyme site analysis and 
nucleotide sequencing. 

15 Example 3 

Construction of pRMG2 

A. Preparation of 265 base pair Hindlll-Xbal gene segment 

Six ug of oligonucleotides BT8, BT9, BT10 t and BT11 were individually phosphorylated in 20 u.l reac- 
20 tions containing 50 mM Tris-HCI (pH 7.8), 10 mM MgCI 2 , 5 mM dithiothreitol. 5% glycerol, 100 uM adeno- 

sine triphosphate, and 20 units T4 polynucleotide kinase (Boehringer Mannheim, Indianapolis. IN) at 37°C 
for 30 min. The kinase was thermally inactivated by heating at 70°C for 5 min. 

Six ug of each of the above phosphorylated oligonucleotides was mixed with 6 ug (6ul) each of oli- 
gonucleotides BT7 and BT12. heated at 70° C for 5 min. and cooled to room temperature to allow the oli- 
25 gonucleotides to anneal. The annealed oligonucleotides were then treated with 30 units T4 DNA ligase 

(Boehringer Mannheim. Indianapolis. IN) in a 200 ul reaction containing 50 mM Tris-HCI (pH 7.8). 10 mM 
MgCI 2 , 5 mM dithiothreitol. 5% glycerol, and 100 uM adenosine triphosphate for 1 hour at 20°C then 18 
hours at 15 C C. 

The desired 265 base pair DNA fragment was conventionally purified on an 8% polyacrylamide gel 
30 and resuspended in water. Two ug of the purified DNA fragment was treated with 20 units of T4 polynu- 

cleotide kinase in a 20 ul reaction containing 50 mM Tris-HCI (pH 7.8), 10 mM MgCI 2 . 5 mM dithiothreitol, 
5% glycerol, and 100 adenosine triphosphate at 37°C for 30 min. 

B. Final construction of pRMG2 

1 .3 ug of the purified 265 base pair fragment prepared in Example 2A and 0.3 ug of the pBluescript 
35 vector DNA prepared in Example 1 B were ligated with 10 units of T4 DNA ligase in a 10 ul reaction con- 

taining 50 mM Tris-HCI (pH 7.8). 10 mM MgCI 2 , 5 mM dithiothreitol. 5% glycerol, and 100 uM adenosine 
triphosphate at 20°C for 18 hours. 

A portion of the ligation mixture was used to transform E.coii K12 MM294 cells. Transformants were 
selected on L agar containing 50 ug/ml ampicillin. Ampicillin-resistant transformants containing the desired 
40 plasmid pRMG2 were identified following plasmid DNA purification by restriction enzyme site analysis and 

nucleotide sequencing. 

Example 4 

45 Construction of pRMG3 

A, Preparation of 227 base pair Hin dlll- Xba l gene segment 

Six ug of oligonucleotides BT14. BT15, BT16, and BT17 were individually phosphorylated in 20 ul re- 
actions containing 50 mM Tris-HCI (pH 7.8), 10 mM MgCI 2 , 5 mM dithiothreitol. 5% glycerol, 100 uM ade- 
nosine triphosphate, and 20 units T4 polynucleotide kinase (Boehringer Mannheim. Indianapolis. IN) at 

50 37°C for 30 min. The kinase was thermally inactivated by heating at 70°C for 5 min. 

Six ug of each of the above phosphorylated oligonucleotides was mixed with 6 ug (6ul) each of oli- 
gonucleotides BT13 and BT18; heated at 70° C for 5 min. and cooled to room temperature to allow the 
oligonucleotides to anneal. The annealed oligonucleotides were then treated with 30 units T4 DNA ligase 
(Boehringer Mannheim, Indianapolis, IN) in a 200 ul reaction containing 50 mM Tris-HCI (pH 7.8). 10 mM 

55 MgCI 2 , 5 mM dithiothreitol. 5% glycerol, and 100 uM adenosine triphosphate for 1 hour at 20°C then 18 

hours at 15°C. 

The desired 227 base pair DNA fragment was conventionally purified on an 8% polyacrylamide gel 
and resuspended in water. Two ug of the purified DNA fragment was treated with 20 units of T4 polynu- 
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cfeotide kinase in a 20 ul reaction containing 50 mM Tris-HCI (pH 7.8). 10 mM MgCl 2 . 5 mM dit h iot hreitoL 

5% glycerol, and 100 uM adenosine triphosphate at 37°C for 30 mm. 

B. Final construction of pRMG3 

1 .3 ug of the purified 227 base pair fragment prepared in Example 3A and 0.3 ug of the pBluescrrpt 
5 vector DNA prepared in Example 2B were ligated with 10 units of T4 DNA ligase in a 10 ul reaction con- 

taining 50 mM Tris-HCI (pH 7.8). 10 mM MgCI2. 5 mM dithiothreitol. 5% glycerol, and 100 uM adenosine 

triphosphate at 20°C for 18 hours. 

A portion of the ligation mixture was used to transform £. coli K1 2 MM294 celfs. Transformants were 

selected on L agar containing 50 ug/mi ampicillin. Ampicillin-resistant transformants containing the desired 
10 plasmid pRMG3 were identified following plasmid DNA purification by restriction enzyme site analysis and 

nucleotide sequencing. 

Example 5 

15 Construction of pRMG4 

A. Preparation of the 218 Base Pair ApaLI-Ndei Restriction Fragment of pRMG1 

Thirty ug of plasmid pRMG1 was digested to completion with 120 units of ApaLI (New England Biolabs. 
Beverly MA) in a 600ul reaction containing 10 mM Tris-HCI (pH 7,5), 10mM MgCI 2 ,and 1mM dithiothreitol. 
and 1 00 ug/ml bovine serum albumin at 37°C for two hours. The enzyme was thermally inactivated by heat- 

20 ing at 70°C for 10 mm. The DNA was digested to completion with Ndel by supplementing the reaction with 

50 mM Tris-Hcl (pH 7.5). 100 mM Nad and 120 units Ndel (Boehringer Mannheim, Indianapolis, IN) in a 
750 ul reaction and incubating at 37 C for two hours. The enzyme was thermally inactivated by heating 
at 70°C for 10 min. The DNA was recovered by ethanol precipitation as described in example 1 B and re- 
suspended in water. The desired 218 base pair fragment was conventionally purified on a 1.5% agarose 

25 gel by electroelution onto DEAE cellulose paper and resuspended in water. 

B. Preparation of the 247 Base Pair Msc l- Apa L1 Restriction Fragment of pRMG2 

Thirty ug of pRMG2 was digested to completion with 75 units (25ul) Msc l (an isoschizomer of Ball. 
New England Biolabs. Beverly, MA) and 120 units (12ul) Apa LI in a 750 ul reaction containing 10 mM Tris- 
HCI (pH 7.5). 10mM MgCI 2 . 1 mM dithiothreitol, and 100 ug/mi bovine serum albumin at 37°C fortwo hours. 
30 The enzymes were thermally inactivated by heating at 70 Q C for 10 mm. The DNA was recovered by ethanol 

precipitation as described in example 1B and resuspended in water. The desired 247 base pair fragment 
was conventionally purified on a 1.5% agarose gel by electroelution onto DEAE cellulose paper and re- 
suspended in water. 

C. Preparation of the 211 Base Pair Mscl- Bam HI Restriction Fragment of pRMG3 

35 Thirty ug pf pRMG3 was digested to completion with 75 units (25 ul) Msc l in a 600 ul reaction con- 

taining 50 mM potassium acetate. 20 mM Tris-acetate (pH 7.9), 10 mM magnesium acetate, 1 mM dithio- 
threitol, and 100 ug/mi bovine serum albumin at 37°C for 2 hours. Tris-acetate is Trizma® acetate 
(Tris[hydroxymethyl]aminomethane acetate) and is available from Sigma Chemical Co., St. Louis, MO 
63187. The enzyme was thermally inactivated at 70°C for 10 min. The DNA was digested to completion 

40 with Bam HI by supplementing the reaction with 50 mM NaCI and 120 units of Bam HI in a 750 ul reaction 

and incubating at 37 C for 2 hours. The enzyme was thermally inactivated by heating at 70°C for 10 min. 
The DNA was recovered by ethanol precipitation as described in example 1B and resuspended in water. 
The desired 211 base pair fragment was conventionally purified on a 1.5% agarose gel by electroelution 
onto DEAE cellulose paper and resuspended in water. 

45 D. Preparation of pHKY390 expression vector 

Twenty ug of plasmid pHKY390 was digested to completion with 240 units Nde l (Boehringer Man- 
nheim, Indianapolis. IN) and 80 units Bam HI (Boehringer Mannheim, Indianapolis, IN) in a 100 microliter 
reaction containing 50mM Tris-HCI (pH 8.0), 10mM MgCI 2 . 100mM NaCI, and 100 ug/ml bovine serum al- 
bumin at 37~C for 1 hr. The enzymes were thermally inactivated by heating at 70°C for 10 min. 

50 The 5' termini were dephosphorylated by treatment of the DNA with 5 units (5 ul) calf intestinal alkaline 

phosphatase (Boehringer Mannheim. Indianapolis, IN) at 37°C for 30 min. The enzyme was thermally in- 
activated by heating at 70°C for 15 min. The solution was extracted with an equal volume of phenol equi- 
iibarted with 100 mM Tris-HCI (pH 8.0). The aqueous layer was recovered and DNA was precipitated by 
the addition of 0. 1 volume 3 M sodium acetate and 2.2 volumes of absolute ethanol. The DNA was collected 

55 by centrifugation and resuspended in 300 ul water. 

E. Final construction of pRMG4 

Two hundred ng of the purified 218 base pair fragment prepared in Example 5A, 200 ng of the purified 
247 base pair fragment purified in Example 5B, 200 ng of the purified 211 base pair fragment purified in 
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ExamDie 5C. and 100 ng of the pHKY39Q vector DN A prepared in Exampie 5D were iigated with 10 units 
of T4 DNA iigase i Boennnger Mannneim. incianapons IN) in a 20 ui reaction containing 50 mM Tris-HCI 
(pH 7.8), 10 mM MgCl 2 . 5 mM dit hiotnreitol. 5% glycerol, and 100 uM adenosine triphosphate at 15-C for 
1 5 hours. 

5 A portion of the ligation mixture was used to transform E. colt K12 MM294 cells. Transformants were 

selected on L agar containing 10 ug/mi tetracycline. Tetracycline resistant transformants containing the 
desired plasmid pRMG4 were identified following plasmid DNA purification by restriction enzyme site ana- 
lysis and nucleotide sequencing of the trypsin gene. 

10 Example 6 

Construction of pRMG5 

A. Preparation of the 225 base pair Apa LI-Hindlll restriction fragment of pRMG1 

Twenty ug of pRMG1 was digested to completion with 80 units of Apa L1 (New England Biolabs, Bev- 
7 5 erly, MA) in a 100 ul reaction containing 10 mM Tris-HCI (pH 7.5). 10 mM MgCI 2 . 1 mM dithiothreitol, and 

1 OOug/ml bovine serum albumin at 37°C for 2 hours. The enzyme was thermally inactivated by heating at 
70°C for 10 min. The DNA was digested to completion with Hindi 1 1 by supplementing the reaction with 50 
mM Tris-HCI (pH 7.5), 50 mM NaCI. and 80 units of Hind III in a 125 ul reaction and incubating at 37°C for 

2 hours. The enzyme was thermally inactivated by heating at 70°C for 10 min. The DNA was recovered 
20 by ethanol precipitation as described in Example 2B and resuspended in water. The desired 225 base pair 

fragment was conventionally purified on a 1.5% agarose gel by electroelution onto DEAE cellulose paper 
and resuspended in water. 

B. Preparation of pRMG3 vector 

Thirty ug of pRMG3 was digested to completion with 75 units of Msc l ( an isoschizomer of Ball , New 
25 England Biolabs, Beverly, MA) in a 600 ul reaction containing 50 mM potassium acetate, 20 mM Tris acet- 

ate, 10 mM magnesium acetate, 1 mM dithiothreitol, and 100 ug/ml bovine serum albumin at 37°C for 2 
hours. The enzyme was thermally inactivated by heating at 70°C for 10 min. The DNA was digested to 
completion with Hind I II by supplementing the reaction with 50 mM Tris-HCI, 50 mM NaCI, and 120 units 
of Hind II I in a 750 ul reaction and incubating at 37°C for 2 hours. The enzyme was thermally inactivated 
30 at 70°C for 10 min. 

The 5' termini were dephosphorylated and the DNA was recovered by ethanol precipitation as descri- 
bed in Example 1B and resuspended in water. 

C. Final construction of pRMG5 

Two hundred ng of the purified 225 base pair fragment prepared in example 5A. 200 ng of the 247 
35 base pair fragment prepared in Example 4B, and 50 ng of the pRMG3 vector DNA prepared in Example 

6B were Iigated with 10 units of T4 DNA Iigase in a 40 vif reaction containing 50 mM Tris-Hcl (pH 7.8), 10 
mM MgCl 2 , 5 mM dithiothreitol, 5% glycerol, and 100 uM adenosine triphosphate at 15 2 C for 15 hours. 

A portion of the ligation mixture was used to transform E. coli K12 MM294 cells. Transformants were 
selected on L agar containing 50 ug/ml ampicillin. Ampicillin resistant transformants containing the desired 
40 plasmid pRMG5 were identified following plasmid DNA purification by restriction enzyme site analysis and 

nucleotide sequencing of the trypsin gene. 

Example 7 

45 Construction of pRMG6 

A. Preparation of the 45 base pair Ndel-Narl segment 

Seven ug of oligonucleotides BT19 and BT20 were individually phosphorylated in 20 ul reactions con- 
taining 50mM Tris-HCI (pH7.8). 10 mM MgCI 2 , 5 mM dithiothreitol, 5% glycerol, 100 uM adenosine triphos- 
phate, and 20 units of T4 polynucleotide kinase (Boehringer Mannheim, Indianapolis, IN) at 37 C C for 30 
so min. The kinase was thermally inactivated by heating at 70°C for 1 0 min. 

The two 20 ul reactions were subsequently mixed, then heated to 70 C C for 5 min. and cooled to room 
temperature to allow the BT19 and BT20 oligonucleotides to anneal. 

B. Preparation of pRMG5 vector 

Twenty ug of pRMG5 was digested to completion with 40 units of Narl (Bethesda Research Labora- 
55 tories, Gaithersburg, MD) in a 100 ul reaction containing 50 mM Tris-HCI (pH 8.0), 10 mM MgCI 2 , and 

1 00ug/ml bovine serum albumin at 37°C for 2 hours. The enzyme was thermally inactivated by heating at 
70°C for 10 min. The DNA was digested to completion with Ndel by supplementing the reaction with 50 
mM NaCI and 80 units of Nde l in a 125 ul reaction and incubating at 37°C for 2 hours. The enzyme was 
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thermally inactivated by heating at 70 : C for 10 min. 

The 5' termini were dephosphorylated and the DNA was recovered by ethanol precipitation as descri- 
bed in Example 1B and resuspended in water. 
C. Final construction of pRMG6 
5 Three hundred and fifty ng of the 45 base pair Narl- Nde l fragment prepared in Example 7 A and 100 

ng of the pRMG5 vector DNA prepared in Example 7B were ligated with 10 units of T4 DNA ligase in a 20 
u.l reaction containing 50 mM Tris-HCI (pH 7.8), 10 mM MgCI 2 , 5 mM dithiothreitol, 5% glycerol, and 100 
u.M adenosine triphosphate at 15-C at 15 hours. 

A portion of the ligation mixture was used to transform E. co//K12 MM294 cells. Transformants were 
10 selected on L agar containing 50 ng/ml ampicillin. Ampicillin-resistant transformants containing the desired 

pRMG6 DNA were identified following plasmid DNA purification by restriction enzyme site analysis and 
nucleotide sequencing of the trypsinogen gene. 

Example 8 

15 

Construction of pRMG7 

A. Preparation of the 695 base pair Bam HI- Nde l trypsinogen gene 

Twenty ng of pfasmid pRMG6 was digested to completion with 36 units of Bam HI (Boehringer Man- 
nheim, Indianapolis, IN) and 20 units of Nde l (New England P" . Beverly, MA) in a 40 u.l reaction con- 
20 taining 50 mM Tris-HCI (pH 8.0). 10 mM MgCI 2 , 100 mM d 100 ng/ml bovine serum albumin at 

37°C for 1 hour. The enzymes were thermally inactivate. ating at 70°C for 10 min. The DNA was 

recovered by ethanol precipitation as described in Example 2B and resuspended in water. 

B. Final construction of pRMG7 

Three hundred and fifty ng of the restricted pRMG6 DNA prepared in Example 8A and 100 ng of the 
25 pHKY390 vector DNA prepared in Example 5C were tigated with 10 units of T4 DNA ligase in a 25 fj reaction 

containing 50 mM Tris-HCI (pH 7 Q > 10 mM MgCI 2 , 5 mM dithiothreitol, 5% glycerol, and 1 00 jaM adenosine 
triphosphate at 15°C for 15 hours. 

A portion of the ligation mixture was used to transform E. coli K12 MM294 cells. Transformants were 
selected on L agar containing 10 ng/ml tetracycline. Tetracycline-resistant transformants containing the 
30 desired pfasmid pRMG7 were identified following plasmid DNA purification by restriction enzyme site ana- 

lysis and nucleotide sequencing of the trypsinogen gene. 

Example 9 

35 Construction of L693/pRMG4 

A. Transformation of L693 with pRMG4 

The E. coli strain L693 was transformed with plasmid pRMG4 DNA from Example 4E. Transformants were 
40 selected on L agar containing 1 0u.g/ml tetracycline. Tetracycline-resistant transformants containing the desired 
plasmid pRMG4 were identified by restriction enzyme site analysis and nucleotide sequencing of the trypsin 
gene. 

Example 1 0 

45 

Construction of L687/pRMG7 

A. Transformation of L687 with pRMG7 

50 The lorfE. coli strain L687 was transformed with plasmid pRMG7 DNA from Example 8B. Transformants 

were selected on L agar containing 10(ag/mi tetracycline. Tetracycline-resistant transformants containing the 
desired pRMG7 were identified by restriction enzyme site analysis and nucleotide sequencing of the trypsino- 
gen gene. 
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20 



25 



30 



35 



45 



50 



<l) GENERAL INFORMATION: 

(i) APPLICANT: ELI LILLY ANT COM FAN"/ 

(3) STREET: Lilly Corporate Center 

CITY: Indianapolis 

STATE : Indiana 

C OUNTR Y : United States of Ame r i c a 



(D 
( E 

(F) ZIP: 46285 



(li) TITLE CF INVENTION: Expression Vectors for Bovine 
'rypsin and Tr/psinoger and Host Cells Transformed Therewith 



(ill) NUM3ER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ALDRE5S : 

(A) AC DRES SEE: C. M. Hudson 

(E) STREET. Erl Wood Manor 

(C) CITY: Wmdlesham 

(D) STATE: Surrey 

(E) COUNTRY: United Kingdom 

(F) ZIP: GU20 6 F r. 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, LO Mb 

storage 
(3) COMPUTER: Macintosh 
(0) OPERATING SYSTEM: Macintosh 
(D) SOFTWARE: Microsoft Word 



55 
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(2) INFORMATION FOR S E 2 1L NO: 1 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single stranded 

(D) TOPOLOGY: linear 

(ll) MOLECULE TYPE: DMA 
(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 1 
AGCTTCATAT GATCGTTGGC GGCTACACCT GTGGCGCCAA TACCGTCCCG 5 0 
15 TACCAGGTGT CCCTGAATTC TGGCTAC 7 7 



10 



INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 
(3) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single stranded 

(D) TOPOLOGY: linear 

(ll) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

30 

Sequence I - D 2 

AGTGGTAGCC AGAATTC AGO GACACCTCGT ACGGGACGGT ATTGGC G CCA 50 
CAGGTGTAGC CGC 2AACGAT CATATGA 7 7 



(2) 

20 



25 



40 



45 



50 



55 
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15 



20 



25 



30 



35 



40 



50 



f A) LENGTH: SI base pairs 

5 j TV P E : Nucleic a:id 

(C) STRAN'DECNESSS : single stranded 

\ j } TO PO L OG V : linear 

(11) MOLECULE TYPE: DMA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
Sequence 1.2. 2 ) f £ ^ u - ; . ■.; e Length: 81) 

CACTTCTGTG GTOGCTL OCT _ArC.---.OTCC C AC YGGGTGG TATC AGCGGG 50 
CCACTGCTAC AAGTCCG CCA CCCAGGTGCG T 81 

(2) INFOP>0\TICN "OR SEC 10 NO: 4 

;i) sequence characteristics: 

(A) LENGTH: SI base pairs 

(B) TYPE: Nuclei z acid 

(C) S CR AN 2 E 2 N E S S S : single stranded 
(2) PCPOLOGV; linear 

Yi) MOLECULE TYPE: DNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
Sequence 1.2. 4 

CCAGACGCAC C~ 2- 3 7G CC 3 AACTTGTAGC AGTGGGCCGC TGATACC ACC 50 
CACTGGGAGT TGATGAG3G A 2CC ACC ACAG A 81 



55 



18 



EP 0 597 681 A1 



15 



(2) INFORMATION FOP SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

!B) TYPE: Nucleic acid 

(C) STPANDEDNESSS: single stranded 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
CTGGGCGAGG ATAACA r I CAA CGTCGTGGAG GGCAATGAGC AGTTCATCTC 5 0 
CGCATCCAAG TCCATCGTGC ACT 73 



(2) INFORMATION FOR SEQ ID NO: 6 

SEQUENCE CHARACTERISTICS : 

(A) LENGTH: ^3 bdse pairs 

(B) TYPE: Nucleic acid 

(C) STRAMDEDNESS5 : double stranded 

(D) TOPOLOGY : linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 



(l) 

25 



35 CTAGAGTGCA CGATGGACTT GGATGCGGAG ATGAACTGCT CATTGCCCTC 5 0 
CACGACGTTG ATGTTATCCT CGC 73 



(8) INFORMATION FOP SEQ ID NO: 7 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 84 base pairs 

(B) TYPE: Nucleic acid 

<C) STRANDEDNESSS: single stranded 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 



55 
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(2) INFORMATION FOP. SE~ ID MO: 6 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: Nucleic acid 

(C) 3TRANOEDNE33S : single stranded 

(D) TOPOLOGY : 1 1 near 

( 11 } MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IC NO: 8 



AGTTCAGGGA TGCO"GGAC TTGAGCTTGA TCAGCATGAT GTCATTGTTC 
AGAGTGTTGG AGTTGTAGGA CCGGTGCACG ATGA 
(2) INFORMATION FOR SEQ ID NO: 9 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 1 base pairs 

(B) TYPE : Nucleic Acid 

(C) STRANDEDNES £ £ : single stranded 

(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9 

AACTCCCGCG TSGZCTCSAT ITCTCTGCCG ACCTCCTGTG CCTCCGCCGG 
CACGCAGTGC CTCATCTCTG GCTGGGGCAA OACTAAGAGC TCT 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 3 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single stranded 

(D) TOPOLOGY: linear 



20 



EP 0 597 681 A1 



(11) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

CGCCAGAGCT CTTAGTGTTG CCCCAGCCAG AGATGAGGCA CTGCGTG 
GCGGAGGCAC AGGAGGTC GG CAGAGAGATG GAGGCCACGC GGG 
(2) INFORMATION FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS: single stranded 

(D) TOPOLOGY: Linear 

(n) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 11 
GGCACCTCCT ACCC A 3ACGT GCTGAAGTGC CTGAAGGCTC CTATCCTGAG 
CGATTCCTCC TGTAAG TCCG CCTACCCTGG CCAGATTT 



[2) INFORMATIDN F PR SEQ ID NO: 12 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : S3 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single stranded 

(D) TOFDLCGY: linear 

(nj MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 



C TAG AAATC T GGCC AGGGTA GGCGGACTTA CAGGAGGAAT CGCTCAGGAT 
AGGAGCCTTC AGGCACTTCA GCACGTCTGG GTAGGAGG 

(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 7 7 base pairs 
(3) TYPE: Nucleic acid 
(C) STRANDEDNESSS : single stranded 
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[ xi 



TION: SEQ I D NC 



ro 



35 



45 



50 



AGCTTCCTGG CIASATTACC A3CAACATC. _____ ^^^^ 
GGCGGC AAGG ATTCCTGCTA GGGTGAT 



(2) INFORMATION FOR SEQ I D NO : 14 



(l) SEQUENCE GHARACTE? I STIC 5 : 

(A) LENGTH: 1 6 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single stranded 

(D) TOPOLOGY: linear 

40 fll) MOPE IN LE TYPE : PNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 



50 
75 



( I ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: " base pairs 
(3) TYPE: Nucleic acid 

20 (C) 3 TP. AND E P N E S S S : single stranded 

\D< TCPIUGGY: linear 

(II) MOLECULE TYPE : ON A 

25 (XI) SEQUENCE JESCEIPTION: SEQ ID NO: 14 

C AG AATC AC C CTGACAGGAA TCCTTGCCGC CCTCCAGGTA GCCGGCACAG 5 0 
30 AACATGTTGC TGGTA^ATCTG COO AGO A 77 

( 2 ) IN FORMAT I IN EC P 3 EQ Z O NO : 1 5 



TCTGGTGGCC CTGTG3TCTG CTCCGGCAA3 CTCCAAGGCA TCGTCTCCTG 5 0 
GGGTTCCGGC TGT3CCCAGA AGAACA 7 6 

(2} INFORMATION FOR SEQ ID NO: 16 
(l) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 7 6 base pairs 

(3) TYPE: Nucleic acid 

(C) STRAND EDNESSS : single stranded 

(3) TOPOLOGY: linear 

(ll) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



GGCTTGTTCT TCTGGGCAC A GCCGGAACCC CAGGAGACGA TGCCTTGGAG 5 0 
CTTGCCGGAG CAGACC ACAG GGCCAC 7 6 



(2) INFORMATION FOR SEQ ID NO: 17 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 4 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single stranded 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 



AGCCTGGCGT CTACACCAAG GTCTGTAACT ATGTGTCCTG GATTAAGCAG 5 0 
ACCATAGCTT CCAATTAGGA TCCT 7 4 
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inforiiaticn f:r seq id no: is 

1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 7 4 base pair: 
'3) TYPE: 



^ * - 



■D) i CP -LOGY: Linear 
Ci) MOLECULE TV PR : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

is CTAGAGGATC CTAATTGGAA GCTATGC-TCT GCTTAATCCA GGACACATAG 5 0 
TTACAGACCT TGGTGTAGAC GCCA 7 4 



(2) INFGPLMATIDN FOR SEQ I D NC: 19 

(I) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 45 base pairs 

(B) TYRE: Nucleic acid 

(C) STRANT-'ECNESSS : single stranded 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 19 

TATGGTGGAT GATGATGATA AGATCGTTGG CGGCTACACC TGTGG 4 5 

(2) INFORMATION FOR SEQ I D NC : 2 0 

(I', SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESSS : single siranded 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



CGCCAC AGGT GTAGCC3CCA ACGATCTTAT CATCATCATC CACCA 4 5 
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INFORMATION FOR SEQ ID NO: 21 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 683 base pairs 

(B) TYPE: Nucleic acid 

(C) STPANDEDNESSS ; double stranded 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 



CAT ATG ATC GTT GGC GGC TAC ACC TGT GGC GCC AAT ACC GTC CCG 4 5 

Men lie Val Gly Gly Tyr Thr Cys Gly Ala Asn Thr Val Pro 14 

TAC CA'J GTC TCC :TG AAT TCT GGC TAC CAC TTC TGT GGT GGC TCC 9 0 

Tyr Gin Val .^er Leu Am Ser Gly Tyr His me Cys Gly Gly Ger 2 9 

GTC AT J AAC !"C: :AG TGG GTG IT A TCA GCG GCC CAC TGC TAC AAG 13 5 

Leu He Asn Ser J 1 r. Trc Val Val Ser Ala Ala Hi 3 Cys Tyr Lys 44 

TCC ( GG C AT: GAG GTG CGT CTG 'GGC GAG GAT AAC ATC AAG GTC GTG 180 

Ger Gly Tl^ Gin 7*1 Arg Leu Gly Glu Asp Asn He Asn Val Val 59 

GAG GCC AAT GAG :AG TTC ATC TCC GCA TCC AAG TCC ATC GTG CAC 22 5 

Glu Gly Asn Slu G I r. rho He Ser Ala Ger Lys Ser He Val His 7 4 

CCG TCC TAC AA1 TCC AAC ACT :TS AAC AAT GAC ATC ATC CTG ATC 2^0 

Pro Sei Tyr Am 3-er Asn Thr Leu Asn Asn Asp He Met Leu He 89 

35 AAG CTC AAG TCC GCC GCA TCC CTG AAC TCC CGC GTG GCC TCC ATC 315 

Lys Leu Lys Ser Ala Ala Ser Leu Asn Ser Arg Val Ala Ser He 104 

TCT CTG CCG kZ Z TCC TCT GCC TCC GCC GGC ACG CAG TCC CTC ATC 3 60 

Ser Leu Pro Thr ~-r Cys Ala Ser Ala Gly Thr Gin Cys Leu He 119 

40 TCT GGC TGS GGC AAC ACT AAG AGC TCT 3GC ACC TCC TAC CCA GAC 405 

Ser Giy Trp Sly Vsn Thr Lys Ser Ser Sly Thr Ser Tyr Pro Asp 134 

GTG CTG AA S TGC :TG AAG SCT ZZT ATC TTC AGC GAT TCC TCC TGT 450 

Val Leu Lys lys l^eu Lys Ala Pro lie Leu Ser Asp Ser Ser Cys 149 

45 AAG TCC GCC TAT TCT GGC SAG ATT ACC AG: AAC ATS TTC TGT GCC 495 

Lys Ser A^a Tyr Pre Giy Sin He Thr Ser Asn Met Phe Cys Ala 164 

GGC TAC CTG GAG GGC GGC AAG GAT TCC TGT CAG GGT GAT TCT GGT 5 40 

Gly Tyr Leu SI- Gly Gly Lys Asp Ser Cys Sin Gly Asp Ser Gly 179 

50 GGC CCT GTG STC TGC TCC GGC AAG CTC C AA GGC ATC GTC TCC TGG 58 5 

Gly Pro Val 1 Cys Ser Gly Lys Leu Gin Gly He Val Ser Trp 194 
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INFORMATION FOR SEQ ZD NO: 21 

( i ) SEQUENCE -CHARACTERISTICS : 

(A) LENGTH: 701 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANOEDNESSS : single stranded 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE SESCR1FTION: £EQ ID NO: 23 

CAT ATG GTG G AT GAT GAT CAT AAG ATC GTT GGC GGC TAC ACC TGT 4 5 
Met Val Asp Asp Asp Asp Lys lie Val Gly Gly Tyr Thr Cys 14 

GGC GCC AAT ACC GTC CCG TAC CAG GTG TCC CTG AAT TCT GGC TAC 9 0 
20 Gly Ala Asn .hr Val Pre Tyr Gin Val Ser Leu Asn Ser Gly Tyr 29 

CAC TTC TGT GGT GGC TCC CTC ATC AAC TCC CAG TGG GTG GCA TCA H5 
His Phe Cys Zly Gly Ser Lej He Asn Ser Gin Trp Val Val Ser 44 

25 GCG GCC 7G: TAC AAC CCS GGC ATC C AG J1 V G CGT CTG GGC GAG 180 

': Tyr Lys Cor Gly He Gin Val Arg L^u Gly Glu 59 
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Ala Ala Hi: 



GAT .AAC ATC AAC G.C GTG GAG GGC AAT GAG GAG TTC ATC TCC CCA 22 5 

Asp Asn He Asr. Val Val Glu Gly Asn Glu Gin Phe Tie Ser Ala 74 

TCC AAG TCC ATC GTC CAC CCG TCC TAC AAC TCC AAC ACT CTG AAC 27 0 

Ser Lys Ser He Val His Pro Ser Tyr Asn Ser Asn Thr Leu Asn 89 

AAT CAC AT: ATG CTG ATC AAG CTC AAG TCC GCC GCA TCC CTG AAC U5 

Asn Asp IH Me: Leu He Lys Leu Lys Ser Ma Aid Ser Leu Asn 104 

TCC CGC GTG 3C : TCC ATC TCT CTC CCG ACC TCC TGT GCC TCC GCC 360 

Ser Arg Val Ala Ser He Ser Leu Pro Thr Ser Cys Ala Ser Ala 119 

GGC ACG C AG TCC CTC ATC TCT GGC TGG GGC AAC ACT AAG AGC TCT 405 

Gly Thr Gin Cys Leu lie Ser Gly Trp Gly Asn Thr Lys Ser Ser 134 

GGC ACC TCC TAC CCA GAC GTG CTG AAG TGC CTG AAG GCT CCT ATC 4 50 

Gly Thr Ser Tyr Pro Asp Val Leu Lys Cys Leu Lys Ala Pro He 149 

CTG AGC GAT TC C TCC TGT AAG TCC GCC TAC CCT GGC CAG ATT ACC 49 5 

Leu Ser Asp Ser Ser Cys Lys Ser Ala Tyr Pro Gly Gin He Thr 164 

AGC AAC ATG TTC TGT GCC GGC TAC CTG GAG GGC GGC AAG GAT TCC 540 

Ser Asn Mec Phe Cys Ala Gly Tyr Leu Glu Gly Gly Lys Asp Ser 179 

TGT CAG GGT GAT TCT GGT GGC CCT GTG GTC TCC TCC GGC AAG CTC 5 85 

Cys Gin Gly Asp Ser Gly Gly Pro Val Val Cys Ser Gly Lys Leu 19 4 

CAA GGC ATC GTC TCC TGG GGT TCC GGC TGT GCC CAG AAG AAC AAG 63 0 
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_ ... , _ ' " j-vj-l- * ,\ - jTG TC _ TItO ATT A.AG 6~ 5 

Trp He Lys 22 4 

CAG ACC ATA OVT TTC AAT TAGGATCC 

Gir, Thr !!e a:.i Asn - " * 



(2) IMFCRMATICN* TOR 5E~ 1G MO: 2 4 

SEQUENCE CHARACTERISTICS: 
< A ) LENGTH : 2 } D base cairs 
(3) TYPE: protein 

(CI 3TRANTEDNESS.S : g merle stranded 
(C) TCPDLCCY: linear 

(ID MORECURE TYPE: protein 

(XI) SFQRFNCE RESCR: FTICN : 3EQ IF NC: 24 

Met VM Asp Asp Asp Asp Lys lie V d ; Gly Gly Tyr Thr Cys Gly 15 

Ala A - - Thr V*_ ?:-r- T/l G_:i Val Se: Leu Asn Ser Sly Tyr His 30 

' ' ' Ji; ' ^ r L ^' J A5:i 2 = - -*^n Trp Val Val Ser Ala 45 

Ala His Cys Cy i Lys Sly Tie Gin Val Arg Leu Gly Giu Asp 60 

Asn Aai:i Vjl1 Val 5 n Glu Gin Phe lie Ser Ala Ser 75 

ivys Ser He Val His Fro Ser Tyr Asn Ser Asa Thr Leu Asn Asn 9C 

Asp lie Me: Leu He Lys Leu Lys Ser Ala Ala Ser Leu Asn Ser 105 

35 krg Ala £er :le p ro Thr Ser Cys Ala Ser Ala Gly 12C 

Thr Gl:i Cys. Leu lie -,er Gly Trp Gly Asn Thr Lys Ser Ser Gly 135 

Thr Ser Tyr r— Asp Val Leu Lys Cys Leu Lys Ala Pro lie Leu 150 

40 Ser Asp Ser ^ r C V S -V' s Ser Ala Tyr Pro Gly Gin lie Thr Ser 165 



Asn Me: Phe Cys Ala Gly Tyr Leu Glu Gly Gly Lys Asp Ser Cys 
Gin Gly Asp Ser Gly Gly Pro Val Val Cys Ser Gly Lys Leu Gin 



45 Gly He Val 



Gly Val Tyr .hi Lys V d 1 Cy z Asn Tyr Vol Ger Trp He Lys Gin 
Th r lie Aid Ser A n 



180 
195 



Sei Tip Gly £ er Gly Cy3 Ma Gln Lys ^ s p ^ 2 ^ 
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55 Claims 

1. A recombinant DNA expression vector comprising the DNA sequence of Sequence I.D. 21 . 
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2. The vector of claim 1 that is plasmid pRMG4. 

3. A recombinant DNA expression vector comprising the DNA sequence of Sequence t.D. 23. 

4. The vector of claim 3 that is plasmid pRMG7. 

5. Amethod of producing bovine trypsin comprising culturing a host cell transformed with the vector of claim 
1 under conditions appropriate for production of bovine trypsin. 

6. The method of claim 5 wherein said host cell is a Ion- host cell. 

7. A method of producing bovine trypsmogen comprising culturing a host cell transformed with the vector 
of claim 3 under conditions appropriate for production of bovine trypsinogen. 

8. The method of claim 7 wherein said vector is plasmid pRMG7. 

9. The method of claim 7 wherein said host cell is a Ion- host cell. 



10. Amethod of producing bovine trypsin comprising 

(a) culturing a host cell transformed with the vector of claim 3 under conditions appropriate for produc- 
20 tion of bovine trypsinogen 

(b) recovering the trypsinogen from step (a) and 

(c) enzymatically converting the trypsinogen to trypsin 



11. Amethod for converting human proinsulin to human insulin comprising treating human proinsulin with bio- 
synthetically produced trypsin. 
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FIG. I 
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FIG. 3 
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